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This work reviews several hierarchical measurements of the topology of complex networks and 
then applies feature selection concepts and methods in order to quantify the relative importance of 
each measurement with respect to the discrimination between four representative theoretical net- 
work models, namely Erdos-Renyi, Barabasi- Albert, Watts-Strogatz as well as a geographical type 
of network. The obtained results confirmed that the four models can be well-separated by using 
a combination of measurements. In addition, the relative contribution of each considered feature 
for the overall discrimination of the models was quantified in terms of the respective weights in 
the canonical projection into two dimensions, with the traditional clustering coefficient, hierarchical 
clustering coefficient and neighborhood clustering coefficient resulting particularly effective. Inter- 
estingly, the average shortest path length and hierarchical node degrees contributed little for the 
separation of the four network models. 



PACS numbers; 



I.75.HC, 89.75.Fb, 89.75.-k 



It is better to know some of the questions than all of 
the answers. (J. Thurber) 



I. INTRODUCTION 

A relevant analysis of several features of complex sys- 
tems can be achieved through the recently developed 
complex netvirork framework (e.g. [H, [3, H, H, §)■ Due 
to the large amount of variables normally involved in 
such dynamical systems, the set up of a interaction net- 
work, based on functional relationship among its degrees 
of freedom, offers a first picture to the actual internal 
structure of the system. This process requires the iden- 
tification of the pertinent degrees of freedom as nodes, 
while the edges that connect them are defined by the 
mutual influence they are subject to. The ability of iden- 
tifying nodes and edges in the appropriate way is a crucial 
step in this modeling. 

The characterization of the so obtained networks con- 
stitute a second important step in this kind of analysis. 
In this process, a small number of features is chosen in or- 
der to measure, in an objective way, pertinent properties 
of the sets of nodes and edges. The choice of measure- 
ments used in the investigation constitutes the second key 
decision during the structural analysis of the networks, 
as it defines which information can be obtained (e.g. [H])- 
Nowadays, it is consensual that the set of basic measures 
include the average number of links per node(fc), the clus- 
tering coefficient C, mean minimal distance among the 
nodes (d), and the network diameter D. However, such 
measurements can not provide a one-to-one characteri- 
zation of the networks, i.e. they yield only a degenerate 
representation from which the original network can not 
be recovered. This is because several distinct networks 
may be mapped into the same set of measurement values. 



Therefore, new and distinct measures have been proposed 
in order to capture new aspects not covered by the set of 
four parameters listed above. 

One particularly important aspect regards the char- 
acterization of individual nodes in the network, as this 
allows the identification of particularly distinct nodes 
such as hubs (e.g. 0]. While both the degree and clus- 
tering coefficient are defined for each individual node, 
they provide but a limited characterization of the con- 
nectivity around those nodes, with several nodes result- 
ing with identical pairs of degree/clustering coefficient 
values even when they are placed at completely different 
contexts in the network. One interesting means to ob- 
tain a richer (i.e. less degenerate) set of measurements 
for each node is to consider subsequent hierarchical neigh- 
borhoods (e.g. 0, 1 i, HH, [11 [H, [11) around each 
node, in addition to the immediate neighbors considered 
in the traditional degree and clustering coefficient. 

Although new measurements provide extra informa- 
tion, it is important to understand how they are re- 
lated to those in the basic set and among themselves. 
If one defines a measure space whose axes are spanned 
by the distinct parameters, one important issue regards 
the distribution of observations along each axis. A full 
answer to such task should include also an analytical re- 
lationship between the co-linear (correlated) measures. 
Another related issue refers to deciding, given a set of 
distinct measures, which of them are most effective in 
identifying and discriminating between distinct kinds of 
networks. The purpose of this work is to address these 
questions, by working with a set of hierarchical measures 
and using sound concepts and methods of multivariate 
statistics (e.g. d, [H, [ll]). We probe a large number 
of networks generated according to four representative 
theoretical models, Erdos-Renyi (ER), Barabasi- Albert 
(BA), Watts-Strogatz (WS) as well as a geographical 
type of network (GG), which can be put in connection to 
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distinct complex network paradigmatic types, displaying 
feature that are associated to the random, scale-free and 
small-world behaviors. In this way, it becomes possible 
to quantify the relative importance of each measurement, 
with respect to the discrimination between the considered 
distinct network models. Although illustrated for these 
specific four types of networks, the reported methodol- 
ogy is completely general and can be applied virtually to 
any problem involving the choice of measurements given 
specific types of theoretical or real- world networks. 

The hierarchical measurements we take into account 
have been discussed in a series of previous investigations 
(e.g. [m [3 ) ! i'^ which the authors have inquired 
how a node sees not only its immediate neighborhood, 
but also successive neighborhoods up to a maximal dis- 
tance D from the reference node. The concepts of hierar- 
chies and higher order neighborhoods, that have been in- 
dependently introduced (e.g. 0, i, i, Ell, El El El ) , 
aim at providing a description of the relationship among 
given sets of nodes which are not necessarily linked by im- 
mediate edges, but for which the minimal distance along 
the network is bound to a value 1 < i < D. £ dependent 
clustering coefficients and node degrees have been investi- 
gated and compared for several sets of networks. In a sec- 
ond line of investigation, a recent contribution raises the 
issue of the interdependence among distinct measures, 
while reviewing the most relevant measures that have 
been introduced so far 5]. The results reported herein, 
heavily based on the ideas developed in the quoted ref- 
erences, are aimed at quantifying the role of the several 
hierarchical measurements while discriminating between 
the four considered theoretical network models. In or- 
der to quantify the influence of each measurement on the 
separation between the four classes of networks, we apply 
sound and objective concepts from multivariate statis- 
tics, namely standardization and canonical projections 

(e.g. UM)- , 

This work is organized as follows: In Section II, we 
present the basic notions of complex networks and of the 
theoretical models used in our investigation. In Section 
III, we discuss the hierarchical measurements that will 
be taken into account for the selection method. These 
methods are presented and discussed in Section IV. Re- 
sults from our analyzes are discussed in Section V, while 
Section VI closes the work with the concluding remarks. 



II. BASIC CONCEPTS 

This section introduces the main concepts used in our 
analysis, including network representation as well as the 
four theoretical models. 



A. Complex Networks Basic Concepts 

A non-weighted complex network T with N nodes and 
E edges can be fully specified in terms of its adjacency 



matrix K, so that K{i,j) = 1 indicates the existence of 
an edge extending from node j to node i. All networks 
considered in this work are undirected, which implies K 
to be symmetric. They are also devoid of multiple or self- 
connections. Networks whose nodes have well-defined 
spatial positions within an embedding space are called 
geographical networks. 

The degree ki of a node i is defined as the number of 
edges connected to it. In case a node j can be reached 
from a node i, we can say that there is a path between 
these two nodes. Two nodes can be connected through 
more than one distinct path. The shortest path dij be- 
tween two nodes i and j corresponds to the path with the 
smallest number of edges connecting those nodes. The 
immediate neighborhood of a node i is the set of nodes 
which are directly connected to i, i.e. the nodes j for 
which dij = 1. The average shortest path (di) to a node 
i is the mean value of di,j over all nodes i ^ j, while the 
network average shortest path (rf) is obtained by tak- 
ing the mean value of {di) over the whole set of network 
nodes. 

The clustering coefficient C'i of node i can be calcu- 
lated as the ratio between the number of edges among 
the immediate neighbors of i and the maximum possible 
number of edges between those nodes. Although mea- 
surements such as the node degree and clustering coeffi- 
cient apply to individual nodes, it is common to take their 
average along the network, yielding the average node de- 
gree (k) and the average clustering coefficient C. 



B. Complex Networks Models 

Several theoretical models of complex networks have 
been proposed (e.g. P, H 0, 0, Q). As announced in 
Section I, the current work considers four of such models 
(ER, BA, WS and GG), the most important features of 
which wc briefly describe below. All networks used for 
the comparison of the hierarchical measurements in this 
work have the same number of nodes N and average node 
degrees (fc) as similar as possible. 

The ER model (e.g. [l|) is characterized by having 
constant probability p of connection between any pos- 
sible pair of nodes. Its average degree is given as (k) — 
2E/N = 2{N - l)p. The BA model can be obtained 
by starting with randomly interconnected mO nodes. At 
each subsequent step, a new node is connected to m nodes 
in the current network such that each connection is pref- 
erential to the degree of the previous nodes. The average 
degree of a BA model is given as 2m. Therefore, in order 
to have ER and BA networks with the same node degree, 
we need to enforce that m = {N — l)p. WS networks can 
be produced by starting with the N nodes distributed 
along a ring and connecting each node to its {k) /2 clock- 
wise neighbors and to the same number of counterclock- 
wise neighbors, with (fc) being an even number. Then, 
a small percentage of edges are randomly rewired. Fi- 
nally, the geographical model considered in this work is 
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obtained by considering a Poisson spatial distribution of 
points with density 7 in a two-dimensional embedding 
space with uniform connecting all pair of nodes w hich 
are at Euclidean distance smaller than ^ (fc) / (77r). 

III. HIERARCHICAL MEASUREMENTS 

Two of the most ubiquitously accepted network mea- 
sures, namely the average number of links per node(/c) 
and the clustering coefhcient C, reflect the immediate 
landscape of the nodes, as they just consider, respec- 
tively, the number of neighbors each node is connected 
to by a direct edge, and how the neighbors of a node 
are connected among themselves. The hierarchical mea- 
surements introduced in [l^, [HI, [H, [H, [i3| first require 
the identification of the sets named the hierarchical shells 
Hi{£) or, alternatively, the neighborhoods Ni{£), of order 
£ of a node i as the nodes that lie at a minimal distance 
£ along the edges of the network of a given node i. For 
the sake of uniqueness, from now on we call these sets 
as Hi{£). The hierarchical measurements result from the 
extension of the two basic concepts to the sets Hi{t). 

For the feature selection analysis we consider, respec- 
tively, two and three distinct types of node degrees and 
clustering coefficients, which are so defined. The average 
degree 

{m)^j2h{£), (1) 

i=l 

where ki {£) counts the number of neighbors which are at 
a minimal distance £ of node i, indicates how the higher 
order neighborhoods of each node are populated. The 

average hierarchical degree 

n 

(2) 

has a different meaning, as kf{£) counts the number of 
links between elements of the two sets Hi{£) and Hi{£+1). 
It expresses how deep connected are the nodes that lie in 
two successive hierarchical shells, namely £ and £ + 1, of 
node i. Observe that we have {k{£ ^ 1)) = {k" {£ = 0)) ^ 
(k) , if we consider that the 0-th order neighborhood of a 
node is the node itself. 

The three distinct £ dependent clustering coefficients 
coincide with the usual C when £ ^ 1. The hierarchi- 
cal clustering coefficient C^{£) counts how many of the 
ki{£){ki{£) — l)/2 pairs of nodes formed the elements of 
the set Hi{£) are directly linked by one edge. On the 
other hand, the neighborhood clustering coefficient {£) 
takes into account those pairs of the same set that are 
neighbors of order £. The original clustering coefficient 
C is a direct measure of the presence of nearby triangles 
in a network, and it indirectly hints to the presence of 
connected structures as cliques. The higher order C^{£) 
and {£) give information on the how the nodes on 



more distinct hierarchical shells are related among them- 
selves. 

Finally, the hierarchical clustering coefficient by balls 
C^(£), which was also previously introduced, constitutes 
the third measurement that takes into account the hi- 
erarchical structure of neighbors of a node. This is a 
cumulative measure in the sense that, instead of consid- 
ering the nodes in a single set Hi {£) , it considers all nodes 
in the set Sjr{£) = [j\=^H,{£). 

To evaluate all the above hierarchical measures we 
profited from the formalism introduced in [l3 |. which 
amounts to first identifying all the higher order neigh- 
borhoods of the networks and storing the information in 
a single matrix 

D 

m = ^£M{£). (3) 

All distinct hierarchical measures can be easily defined 
in terms of the elements of M. 

IV. FEATURE SELECTION METHODS 

Given L classes of networks (in the case of the cur- 
rent article the four theoretical models ER, BA, WS and 
GG) and Q respective measurements of their topology, 
an important question is: which subset of measurements 
is more effective for discriminating between such classes? 
Such a problem provides a good example of feature se- 
lection. 

Two main approaches have been considered for fea- 
ture selection: filter and wrapper. The difference be- 
tween these two families of methods is that the latter 
evaluates the features by considering the results obtained 
after feeding them into a classifier, while the former 
methods investigate the intrinsic relationship between 
the measurements between and/or within the classes 
(e.g. [H, [13, nil))- For instance, the canonical projec- 
tion method used in this work provides an example of a 
ffiter approach to feature selection. 

It should be observed that none of the feature ap- 
proaches are absolutely optimal. While wrapper meth- 
ods will select features which are most effective for given 
classifiers, filter approaches will depend on the definition 
of some optimality criterion. For instance, the canonical 
projection method adopted in this work quantifies the 
separation between the classes by maximizing the dis- 
tance between the classes and minimizing the dispersion 
inside each class (see Section HV C[) . Because our interest 
in the current work is to characterize the discrimination 
power of the several hierarchical measurements, we limit 
our attention to filter feature selection methods. 

The following subsections present the basic concepts 
from multivariate statistics as well as the principal com- 
ponent analysis and the canonical projection methodolo- 
gies. 
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A. Basic Concepts in Multivariate Statistics 

Let each of the Q objects of interest (e.g. networks) 
be characterized in terms of R measurements x{i), i = 
1,2, . . . , R. It is convenient to organize the set of mea- 
surements obtained for each object p — 1,2,. . . ,Q into 
the respective feature vector 

Vp = [xp{l),Xp{2),...,Xp{R)f . (4) 
The mean feature vector fl can be calculated as 



K^) = Q^^Xpi^)- (5) 

The elements C{i,j) of the covariance matrix C of the 
measurements of the objects can be estimated as 



c{i,j) = ^ Y.^xp{i) - t^mxpij) ~ ^i(J)) (6) 

The standardized feature vector can be obtained as 

^ ^ \ xp{l)-^l[l) Xp{2)-fi{2) Xp{R)-fi{R) V 

[ ^(1) ' ^(2) <7{R) _ ' 

(7) 

where a{i) is the standard deviation of measurement x{i). 
Note that each normalized measurement s{i) has zero 
mean and unity standard deviation. 

The Pearson Correlation Coefficient between two mea- 
surements x{i) and x{j) can be given by the covariance 
between the standardized measurements s{i) and s(j). 

B. Principal Component Analysis — PCA 

The multivariate statistical method known as principal 
component analysis (PCA) allows dimensionality reduc- 
tion while maximizing the data variance along the first 
projected axes (e.g. [sTflsLfTsjl. Because the class of each 
point is not taken into account in this method, it cor- 
responds neither to filter nor wrapper feature selection. 
This method is considered in this work for two reasons. 
First, it can be used to obtain preliminary visualizations 
of the distribution of points and classes. Second, it pro- 
vides an introduction and a comparison standard to the 
more sophisticated canonical projections methodology, to 
which it is related. 

Given the set of Q objects, characterized by R mea- 
surements, it is possible to project such measurements 
into a reduced space with W < Q dimensions. In order 
to do so, the covariance matrix C of the measurements 
is estimated as described in Section IIV Al and its eigen- 
values and respective eigenvectors are calculated. The 



eigenvectors corresponding to the W largest eigenvalues 
(in decreasing order of absolute values) are organized into 
a matrix A such that each line corresponds to an eigen- 
vector. The matrix A defines the statistical linear trans- 
formation of the original set of data that maximizes vari- 
ances along the first new axes. Provided W is equal to 2 
or 3, the so-transformed data can now be visualized as a 
2D or 3D distribution of points. The original classes of 
each point can be visualized with different marks. 

C. Canonical Projections 

The method of canonical projections, also called 
canonical analysis or canonical variables (e.g.[l^ ll^). 
also performs a projection of the original measurement 
space, but now considering explicitly the original classes 
of each object. The projection is performed not in order 
to maximize the variances along the first new axes, but 
so as to obtain maximum separation of the classes, quan- 
tified by an optimality index f reflecting the distribution 
of the data both inside and among classes. More specif- 
ically, ^ will favor well-separated classes, with small dis- 
persions of the respective objects. The inter- and intra- 
class dispersion matrix, respectively and Da, can be 
calculated as described in 0, [l^, [l3| ■ The eigenstructure 
of the matrix {Da)~^De provide the basis for the sought 
linear transformation (as in the PCA, the eigenvectors 
associated to the largest absolute eigenvalues are stacked 
as lines in the transformation matrix) maximizing the 
separation between the classes. For instance, in the case 
of canonical projections into two-dimensional spaces, the 
eigenvectors vl and v2 associated to the largest and sec- 
ond largest absolute eigenvalues are used to define the 
projection linear transformation. Similarly to the PCA, 
the contribution of each original measurement to the pro- 
jection can be quantified in terms of the absolute value of 
the weights defined by the respective eigenvector trans- 
formation matrix. Therefore, the measurements yield- 
ing the largest absolute weights for the first axes can be 
understood as those which are more important for the 
separation between the classes. In this work we define 
the importance of each measurement i as the sum of the 
absolute values of the respective weights in i;l and v2, 
i.e. 

m^\vli^)\ + \v2i^)\ (8) 

In order to avoid intrinsic biases implied by the relative 
amplitude of each measurement, it is interesting to per- 
form the canonical projections on standardized versions 
of the measurements. 



V. RESULTS AND DISCUSSION 

In order to investigate, in a comparative fashion, the 
relative contributions of each measurement for the char- 
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acterization and discrimination between the four con- 
sidered complex network models, 30 realizations of each 
model, all with mean degree equal to 6 and sizes N of 
100, 200 and 300 nodes, were first obtained. Two types of 
WS networks were obtained, considering O.IA^ and O.IE 
connection rewirings, where E is the overall number of 
connections. These two types of WS networks are hence- 
forth abbreviated as WS-R and WS-S. Observe that the 
former type corresponds to almost regular networks (i.e. 
similar node degrees throughout), while the latter type 
presents the small world property. All considered models 
had their average traditional and hierarchical measure- 
ments (for £=1,2, and 3) calculated and used as feature 
vectors. Table U lists the considered measurements as 
well as their respective symbols and abbreviations. 



Measurement 


Symbol 


Abbrev- 






iation 


Hierarchical clustering coefRcient by balls 


C^(£) 


cb 


Hierarchical clustering coefficient 




cl 


Neighborhood clustering coefficient 




cn 


Average number of nodes 


m) 


n 


Average shortest path 




sp 


Average hierarchical degree 


(fc^(£)> 


hd 



TABLE I: The symbols and abbreviations of the considered 
hierarchical measurements defined in Sections II and III. The 
respective hierarchical level {£) is henceforth represented in 
front of each abbreviation, e.g. the hierarchical node degree 
at level ^ = 3 is abbreviated as hd3 



Figure [T] shows the two-dimensional phase spaces ob- 
tained by PCA projection of the original 13-dimensional 
phase spaces for N = 100 and 300. Each point in this 
phase space corresponds to a specific network realization. 
The ER and BA clusters resulted near one another, which 
was also obtained for the WS/GG pair of clusters. The 
GG networks resulted in the most dispersed cluster. 

The phase spaces obtained while considering all 13 
measurements were also projected into two dimensions 
by using the canonical methodology described in Sec- 
tion IIVGI Figure [5] shows the projected phase spaces 
obtained for networks with size of iV = 100 (a) and 300 
(b) nodes, respectively. It is clear that the separation 
between the four networks modes is much better than 
that obtained by using PCA (Fig. [J). It is also clear 
from the two dimensional spaces in Fig. [2] that the four 
models could be very well separated as a consequence of 
using such a comprehensive set of features. Interestingly, 
the ER/BA and WS/GG models again tended to clus- 
ter together. Observe that the dispersion of the points 
for all distinct classes decreased for larger N. The rel- 
ative separation between the ER and BA models along 
the V2 direction also decreased for this case. Also, the 
WS-R and WS-S families of networks resulted near one 
another. 
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FIG. 1: The distribution, in the PCA projected phase space, 
of the four theoretical complex network models obtained for 
networks with 100 (a) and 300 (b) nodes. The axes pi and 
p2 correspond to the two main projection orientations as pro- 
vided by the eigenvectors associated to the largest absolute 
eigenvalues of the covariance matrix. 



The seven most important measurements considering 
all the three network sizes, in decreasing order, were: 
cbl = chl = cnl ~ C, c53, cn2, cn3, c52, c53 and hd2. 
Figure [3] shows the importance of each of these measure- 
ments obtained for each of the network sizes (i.e. 100, 
200 and 300). It is noteworthy the absence of two of the 
most used measurements, ((fc) and (d), as well as of the 
higher order hierarchical node degrees, for the purpose 
of identifying the distinct network classes. The fact that 
all networks considered in this work had nearly the same 
average degree explains why (k) had little contribution 
for the discrimination. However, the fact that the distri- 
bution of average hierarchical degrees have been found 
to vary between different network models [l2| should, at 
least in principle, imply better discrimination potential 
for those measurements. The relatively minor contribu- 
tion provided by the average shortest path is also sur- 
prising. 

The traditional clustering coefficient, C resulted the 
most important measurement in all cases, but the rele- 
vance of considering higher hierarchies for the character- 
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FIG. 2: The distribution, in the projected phase space, of the 
four theoretical complex network models obtained for net- 
works with 100 (a) and 300 (b) nodes. The axes vl and v2 
correspond to the two main projection orientations as pro- 
vided by the eigenvectors associated to the largest absolute 
eigenvalues. 



ization of the considered networks was corroborated by 
relatively high importance obtained for the other mea- 
surements. Interestingly, the importance of the cluster- 
ing coefficient by balls for hierarchy 3 tended to increase 
with N, while the neighborhood clustering coefficient for 
hierarchy 3 decreased with that parameter. The former 
effect is a consequence of the fact that the hierarchical 
degree becomes more relevant in larger networks, because 
such networks have larger diameter and therefore allow 
more elaborated and unfolded neighborhoods. 

Figure |4] illustrates several scatterplots obtained con- 
sidering pairs of the adopted measurements. Fig- 
ures|li;a,b), corresponding to C^(2) x C^(2) and k"{2) x 
fc^(2), show that these two measurements provided good 
discrimination between the four network models. For 
other measurements (not shown), we can observe the ex- 
istence of fewer distinct disjoint regions. The other scat- 
terplots in Figure S] correspond to pairwise associations 
between distinct measurements. For instance, the panel 
C^(2) X C^(3) (Fig. HI:) has most of the points aligned 
along the diagonal, indicating that these measures pro- 
vide almost the same kind of information (statistically, 



1.0 
0.8 
0.6- 
0.4 



cbl 



cbl cb3 cn2 cn3 hi2 



hit! 



FIG. 3: The importance of the most relevant measurements 
for each of the considered three network sizes, i.e. 100, 200 
and 300 nodes. 



the two measurements are said to be correlated). A simi- 
lar tendency was observed for the same hierarchical mea- 
surements with distinct values of ^, as in the panels for 
C^(2) X C^(3) (Fig.Hi) and {k"{2)) x (fc^(3)) (Fig.S;). 
Note, however, that the points in these cases are aligned 
in a less clear way in comparison to C^(2) x C^(3) 
(Fig. |4);). The panels which combine hierarchical mea- 
sures of distinct classes (C^, {k^),C^) have the points 
away from the diagonal (e.g. Fig.[4^-h). This is a indica- 
tion that these measurements are uncorrelated, tending 
to provide non-redundant information and, consequently, 
enhanced discrimination power. It is however important 
to stress that the overall discrimination can not be fully 
predicted simply from pairwise relationships between the 
measurements, such as those illustrated above. Observe 
also that none of the two-dimensional scatterplots in Fig- 
urelUprovide separation between the four models as good 
as that shown in Figure That is because the latter 
scatterplot was obtained from the much higher dimen- 
sional phase space by projecting into the plane allow- 
ing the best separation between the four models. Such 
a result clearly corroborates the increased separability 
allowed by the consideration of a comprehensive set of 
distinct hierarchical measurements. 

Finally, we proceeded with a further test to investigate 
the ability of the used feature section methods to uncover 
network specificities, by adding one extra network to two 
of the previous groups of 120 specimens. We considered 
the Apollonian network (AN) [2lj , a geometrical as- 
sembly of nodes and links, which is defined on the basis 
of the classical problem of finding the optimal covering of 
a plane by circles. As they are defined in a recursive way, 
we considered two successively AN generations, respec- 
tively with N = 124 and 367 nodes, and included them 
into the groups consisting of networks with N — 100 and 
300. It is important to recall that AN shows several fea- 
tures that are typical both of small world (small D and 
(d)), and scale free scenarios {p{k) ~ k~'^). The fea- 
ture selection method can help to identify whether AN 
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FIG. 4: Scatterplots respective to pairwise combinations of the adopted measurements for N = 300. Measurements cn2 and hd2 

provided the best discrimination between the network models when taken individuaUy (a,b) . Measurements taken at successive 
hierarchies tended to be moderately correlated (c), while distinct types of measurements presented very little correlation (d-h). 
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lies closer to the WS or BA clusters and, hence, to in- 
dicate which of the quoted scenarios it stays closer to. 
The results {vl,v2) = (-0.90,2.53) for N = 124, and 
(-0.24, 0.96) for N = 367, indicate that, for both gen- 
erations, the AN is mapped away from all four clusters 
corresponding to each of the considered theoretical mod- 
els. This is, indeed, a very interesting result, as it shows 
that our method is able to identify that this type of net- 
work presents quite distinct topological features, albeit 
it shares some common properties with those that fit 
into those the small world and scale free scenarios. In 
other words, the very fact that several features of a given 
network coincide with those typical for a large network 
class, does not automatically implies that it belongs to 
the same set. 



VI. CONCLUDING REMARKS 

Much of the advances in science have only been al- 
lowed by ability of researchers to focus attention on the 
most important features and variables in each problem. 
Because human beings have a rather limited ability to 
cope with a large numbers of measurements, it becomes 
critical to devise and apply methods which can possibly 
identify the most relevant features. Fortunately, sound 
and objective concepts and methods — defining the re- 
search area called feature selection — have been devel- 
oped which can help us in such tasks. Perhaps for histor- 
ical reasons, such methods are not so widely known and 
used by the Physics community. 

Because highly structured complex networks can only 
be comprehensively characterized by considering several 
measurements, the application of feature selection meth- 
ods presents great potential for helping researchers in 
that area. In a recent work canonical variables 

projections and Bayesian decision theory were applied 
in order to classify complex networks and to investigate 
measurements. The current work has unfolded such a 
possibility with respect to the discriminative potential of 
a comprehensive set of hierarchical measurements. 

While the traditional node degree, clustering coeffi- 
cient and shortest path provide quantifications of im- 
portant features of the networks under analysis, they 
are degenerated in the sense that several networks may 
map into the same measurement values. The exten- 
sion of such concepts to reflect also the progressive 
neighborhoods around each node has been proposed 
(e.g. El, [13, [H, [I3l) in order to obtain enhanced, 
less degenerated, characterizations of complex networks. 
Each of such measurements are defined for a series of 
hierarchical levels £, yielding a high dimensional mea- 
surement space. Actually, the values of such measure- 
ments taken at each level can be understood as a mea- 
surement in itself. Given such a large number of fea- 
tures, it becomes important to identify which measure- 
ments are potentially more effective in providing discrim- 



inative descriptions of specific types of networks under 
analysis. In this work, we applied standardization and 
canonical projections in order to identify the most im- 
portant measurements in Table |T] with respect to four 
representative complex networks models, namely Erdos- 
Renyi, Barabasi- Albert, Watts-Strogatz and a geograph- 
ical type of network. Each hierarchical measurement 
was calculated along three successive neighborhood lev- 
els. The traditional average degree, clustering coefficient 
and shortest path lengths were also considered. A total 
of 13 measurements were considered in our investigation. 

Several interesting findings have been obtained by 
the applied methodology. First, four types of networks 
were well-separated even in the two-dimensional canon- 
ical projected phase space (considerably worse separa- 
tions were obtained by PC A), with the pairs of models 
ER/BA and WS/GG forming superclusters. By taking 
into account the respective weights of each measurement 
in the canonical projections, it was possible to associate 
an overall importance value to each measurement. Such 
values were calculated for three network sizes {N = 100, 
200 and 300). While the traditional clustering coeffi- 
cient was identified as contributing more intensely for 
the separation between the network types, several hierar- 
chical measurements resulted in relatively high comple- 
mentary contributions, with the hierarchical clustering 
coefficient by balls {C^{£)) and neighborhood clustering 
coefficient (C^ {£)) providing particularly relevant con- 
tributions. The node degree (traditional and hierarchi- 
cal), as well as the average shortest path length, did not 
contribute significantly to the separation of the network 
models. It is important to recall that such results are, in 
principle, specific to the separation of the four considered 
types of networks, in the sense that different results may 
be obtained when considering other networks models. 

In addition to providing an objective means for se- 
lecting measurements for characterization and discrimi- 
nation of complex networks models, the multivariate ap- 
proach considered in this work can also provide valuable 
insights about the structural differences between distinct 
types of networks. For instance, the fact that the clus- 
tering coefficient resulted more relevant than the short- 
est path length suggests that the four considered mod- 
els present local connectivity (expressed in the clustering 
coefficient) even more distinct than shortest path length 
distribution. It would be interesting to apply such mul- 
tivariate methods to the characterization of other types 
of networks, especially those involving community struc- 
ture. 
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